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ABSTRACT 


Independent  time-series  data,  consisting  of  white  noise  with 
different  degrees  of  oversampling,  have  been  used  to  study  the  effect  of  over- 
sampling  on  the  time-adaptive  prediction  filter.  If  the  data  are  oversampled, 
false  gain  occurs  in  the  adaptive  prediction  results.  The  false  gain  depends 
on  the  degree  of  oversampling,  the  number  of  channels  used  in  making  the  pre¬ 
diction,  the  filter  length,  and  the  convergence  parameter. 

Two  adaptive  algorithms  —  ->r.e  having  a  constant  convergence 
parameter  and  the  other  having  a  variable  convergence  parameter  —  are  dis¬ 
cussed  in  this  report.  Particular  cases  of  the  prediction  mean- square-error 
function  of  the  time-adaptive  filter  are  derived  and  compared  to  the  empirical 
results.  Although  the  false  gain  can  be  quite  severe  for  high  rates  of  adaption, 
rates  of  adaption  can  be  selected  in  terms  of  theoretical  maximum  rates  of 
adaption  so  that  the  false  gain  is  not  significant  —  even  for  oversampled  data 
cut  at  1/20  of  the  folding  frequency. 
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SECTION  I 

INTRODUCTION  AND  SUMMARY 


Previous  work  has  shown  that  adaptive  filters  effectively  reduce 
problems  with  time-varying  noise  fields^  and  that  adaptive -filter  design  can  be 
achieved  with  off-line  processing  to  produce  filters  equivalent  in  mean-square- 
error  to  conventional  Wiener  filters. 

It  is  heuristically  obvious  and  has  been  empirically  observed 
that  oversampled  data,  combined  with  a  fast  rate  of  adaption,  produces  false 
gain.  This  report  quantitatively  assesses  the  expected  degree  of  false  gain 
and  determines  this  phenomenon's  significance  in  on-line  adaptive  processes. 

The  time-adaptive  filter  considered  in  this  report  can  be  de¬ 
scribed  as  follows.  If  Y.  is  the  output  of  a  time-adaptive  filter,  then 

Y.  =  WT  X.  (1-1) 

J  J  J 


where  W:is  the  column  vector  and  the  filter  at  the  j  adaptive  iteration  X. 
th 

is  the  j  sample  of  the  input  data.  The  adaptive  algorithm  can  be  represented 


where 


W.  +  2k  €.  X. 
J  8  J  J 


e  r  d.  -  Y. 
J  J  J 


and 

k  is  a  positive  convergence  parameter 
s 

th 

d.  is  the  j  desired  response 


d-2) 

d-3) 
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The  mean-square-error  of  the  iteration  is  defined  as 


e2  =  (d.  -  Y.)2 
J  J  J 


(1-4) 


This  report  investigates  the  effects  of  filter  length,  sampling  rate, 
and  number  of  channels  on  the  false  gain  for  two  adaptive  schemes:  one  having 
a  constant  convergence  parameter  (represented  by  kg  in  Equation  1-2)  and  one 
having  a  variable  convergence  parameter  (represented  by  substituting  b.  for 

lj  J 

k  in  Equation  1-2,  where  b.  -s  — — -  and  b  is  a  constant). 

8  J  X  X. 

J  J 


Both  schemes  have  been  run  on  data  having  uniform  but  band- 

limited  spectra.  Single- channel  21-point,  single -channel  35-point,  7-channel 

3-point,  and  7- channel  5-point  filters  are  each  applied  to  dat.a  bandlimited  to 

1/20,  to  1/4,  and  to  the  folding  frequency.  Table  I- 1  shows  the  experimental 

cases  used.  The  data  are  independent  from  channel  to  channel.  A  complete 

description  of  the  method  for  generating  these  data  is  in  an  earlier  Advanced 

2 

Array  Research  special  report. 
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Table  1-1 

PARAMETER  COMBINATIONS 


Filter  Bandwidth 

Lengths  (fraction  of  folding  frequency) 

21,  35  1/20,  1/4,  1 

3,  5  1/20,  1/4,  1 
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SECTION  II 

PREDICTION-ERROR  FUNCTION  OF  TIME-ADAPTIVE  FILTER 

The  intention  was  to  develop  exact  formulas  or  approximations 

expressing  the  mean-square-error  as  a  function  of  k  ,  number  oi  channels. 

s 

and  filter  length  for  each  bandwidth;  this  goal  was  not  satisfactorily  achieved* 
although  certain  theoretical  results  of  interest  were  obtained  and  are  de¬ 
veloped  in  this  section. 


The  prediction-error  function  can  be  obtained  by  substituting 
Equations  1-1  and  1-3  into  Equation  1-2  recursively  as  follows; 


<  .  =  d.  -  W  T  X. 
J  J  J  J 


‘VftV2*.  v,*jT-,)Si 

■  v[*jT-iG-2k. 

-  dj  •  HT-  2  [‘  -  2ks  *j  -  2  X?.  2]  [l  2  ks  .1 


+  2ks  dJ  -  2  S7-  1  [J  -  2  k3 


(2-1) 
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where 


jn1  (l  -  2  k  X.  =  I  for  l  >  j  -  1 

i=i+l\  s  l  If 


I  is  the  unit  matrix 
Wj  is  the  initial  filter 


The  factors  in  the  matrix  products  are  assumed  to  be  ordered  from  left  to 
right;  i.  e.  , 


J-1 

n  ( •  ) 

l- i+  1 


Thus,  the  mean-square-error  can  be  expressed  as 


If  we  set  W  =0  and 


IV  d,  *T  Jn*  (i-a  *,  xj)l  x  « 

[£,  1  1  t-m 1  .  t  t/j  j 


(2-3) 


Equation  2-2  can  then  be  expressed  as 


2  2  -  2  2 

c.  =  d.  -  4  k  d  a  +  4k  a. 
j  j  »  J  j  •  J 


(2-4) 


Equation  2-4  can  be  simplified  in  the  following  special  cases. 


•  From  Equation  2-4,  it  can  be  seen  that 


2  2 
lim  € .  =  ds 

k  -0  J  J 


This  corresponds  to  the  nonadaptive  case. 

•  If  d^  and  are  independent  for  ail  j  and  i 
and  d^d^  =  0  for  j  4  i,  then*  from  Equations 
2-3  and  2-4, 


2  J-  X  „  .  2  2 

C .  =  d .  +  4  k  a 

J  J  »  j 


increases  monotonically  as  k^  increases, 
This  corresponds  to  the  case  where  the  de¬ 
sired  output  is  not  ovcrsamplcd. 


For  k  <  <  1, 
s 

k  d . a .  >  >  k^  a^ 
s  j  j  s  j 


(2-5) 


2  2  - 

C .  —  d .  -  4  k  da 
J  J  *  J  J 
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If  cL,  Xj  arc  independent  for  all  i,  j,  and 

if  both  d^  and  x  are  sufficiently  oversampled, 

it  can  be  shown  that  d.a.  >0. 

)  ) 

Thus, 


for  k  in  a  neighborhood  of  0. 

•  From  Equation  2-4,  it  follows  that,  for 

sufficiently  large  kg,  increases  monotonically , 
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solonoo  ••rvlott  division 


SECTION  III 

EMPIRICAL  RESULTS  AND  CONCLUSIONS 


i  igures  III-l  through  III*- 12  show  mean-square -error  curves 
plotted  vs  increasing  value  of  the  convergence  parameter  for  each  configura¬ 
tion  of  Table  1-1.  Independent  4096-point  random  time-series  data  having  a 
unit  mean-square-error  value  for  each  channel  are  used.  In  each  case,  the 
desired  output  has  the  same  statistical  properties  as  the  data  used  for  pre¬ 
diction  but  is  independent  of  the  data. 

The  initial  filter  has  been  set  to  0  (the  optimum  filter)  for  each 
run.  The  initial  transient  effect  is  assumed  to  be  negligible,  as  indicated  by 
runs  made  with  16,384-point  Mme-series  data  for  the  single-channel  21-point 
filter.  Different  starting  points  (zeros  and  peaks)  of  the  predicted  time  series 
for  the  21 -point  and  35-point  single- channel  filters  results  in  a  <  1 . 5  percent 
difference  in  mean-square-error  for  data  cut  at  1/20.  Therefore,  variation  of 
the  starting  point  does  not  significantly  affect  the  mean-square-error  curves. 

A.  CONSTANT  CONVERGENCE  PARAMETER 

Figures  III- 1  through  III  —  4  show  mcan-square-e rror  computed 
for  a  constant  convergence  parameter.  As  expected,  the  false  gain  increases 
when  the  number  of  filter  coefficients  is  increased  by  using  more  channels  or 
longer  filters.  There  is  more  false  gain  when  the  number  of  channels  is  in¬ 
creased  than  when  longer  filters  are  used.  The  degree  of  false  gain  is  un¬ 
expected.  As  seen  in  f  igure  III -4 ,  for  the  7-channel  5-point  filter  arid  severe 
oversampling,  a  maximum  of  92  percent  of  the  energy  can  be  falsely  predicted. 


Ill-  1 
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Theoretical  approximations  to  the  points  of  divergence  are  in- 
dicated  in  Figures  III-  1  through  III-4.  According  to  Widrow,J  k  is  bounded 


k  £ - - - r  A  77  (A  is  defined  equal  to) 

s  I  y  1 2  =  U  - 

max  X . 

1  1 


A  simple  approximation  to  this  bound  results  from  the  assumption  of  inde¬ 
pendence  of  the  components  of  5?  and  the  use  of 

J 


(1)  *  L 

VQ /  Q 


The  resulting  bound  for  k^  is 


k  i» 
s 


7 - 1 - T\  L  B 

(  NC  *  LF  *  a  ) 


2 

where  o  is  the  mean-square-value  of  any  component,  NC  is  the  number  of 
channels,  and  L.F  is  the  length  of  the  filter.  Actual  points  of  divergence  k 
satisfied. 


10  s 


so  the  approximation  is  reasonably  accurate  and,  as  shown  by  experience, 
can  be  used  for  selecting  kg  values. 

2 

The  value  of  o  for  one  corrected  channel  (consisting  of  an  in¬ 
dividual  channel  minus  the  mean  across  channels)  is  required  for  maximum- 
likelihood  adaptive  processing;  thus,  the  use  of  the  preceding  approximation 
in  maximum-likelihood  filtering  can  lead  to  improper  values  of  k  unless  the 
constraint  energy  is  considered. 


B.  VARIABLE  CONVERGENCE  PARAMETER 


Figures  III  -  5  through  IIJ.-8  correspond  to  the  cases  illustrated 
in  f  igures  III- 1  through  III-4  except  for  the  use,  in  the  place  of  k^ ,  of  the 
variable  convergence  parameter 


b.  t 

J  = 


Figures  III  -  9  through  III- 12  show  the  same  data  plotted  against 

the  mean  value  of  b  . 

J 

In  Figures  III-  1  and  III- 5  (as  well  as  III- 9) ,  curves  for  b  (and 

mean  value  of  b  )  vary  strikingly  from  those  for  k  for  the  severely  over- 

J  s 

sampled  data.  For  small  values  of  b,  the  mean-square -e rror  is  higher  than 
it  would  be  if  the  data  were  not  oversampled!  Also  note  that  the  divergence 
point  is  extended  when  using  a  variable  convergence  parameter  by  a  factor 
dependent  on  the  degree  of  oversampling. 

Radical  differences  in  the  mean-square -error  curves  for  the 
two  adaptive  algorithms  occur  primarily  in  the  single -c hannel  cases.  Only 
slight  differences  exist  between  the  results  for  the  two  adaptive  techniques  in 
the  3-point  and  5-point  7-channel  filter  cases. 

This  study  proves  the  need  for  caution  in  the  off-line  use  (on 
short  data  segments)  of  adaptive  filters  which  attempt  the  use  of  high  adaption 
rates  to  avoid  multiple  passes.  Also,  it  can  be  concluded  that,  by  using  k^  £ 
0.001B,  false  gain  will  not  be  a  significant  problem  —  even  for  severely  over¬ 
sampled  data. 


Ill -7 


•oionoo  sorvioos  division 


Figure  III-7.  7-Channel  3-Point  Filter,  Variable 

Convergence  Parameter,  Mean-Square-Error  Vs 
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